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Instructional Change? 
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The past ten years have witnessed an 
explosion in the use of interim assessments by 
school districts across the country. A primary 
reason for this rapid growth is the assumption 
that interim assessments can inform and 
improve instructional practice and thereby 
contribute to increased student achievement. 
Testing companies, states, and districts have 
become invested in selling or creating interim 
assessments and data management systems 
designed to help teachers, principals, and dis- 
trict leaders make sense of student data, iden- 
tify areas of strengths and weaknesses, identi- 
fy instructional strategies for targeted stu- 
dents, and much more. Districts are keeping 
their interim tests even under pressure to cut 
budgets (Sawchuk, 2009). The U.S. Depart- 
ment of Education is using its Race to the Top 
program to encourage school districts to 
develop formative or interim assessments as 
part of comprehensive state assessment 
systems. 

Much of the rhetoric around interim 
assessments paints a rosy picture. Supporters 
argue that these tests will provide data on stu- 
dent understanding; teachers’ analysis of this 
data will in turn lead to greater differentiation 
of instruction and better teaching of content, 
leading to improved student learning. Much 
of the belief in the potential of interim assess- 
ments comes from the body of research on 
formative assessment, particularly those stud- 
ies showing that “short-cycle” formative 
assessments — largely those that are based on 
information collected by teachers in their 
classrooms and that provide feedback to 
teachers within a single class period — are a 
powerful means to improve the quality of 
teaching and raise student performance 
(Black & Wiliam, 1998; Crooks, 1988; Hattie 
& Timperley, 2007; Natriello, 1987). 


However, this research has not focused on 
interim assessments, but rather on practices 
that are embedded within classroom instruc- 
tion. 

Very little research exists on how interim 
assessments are actually used, by individual 
teachers in classrooms, by principals, and by 
districts. Some recent studies surveyed teach- 
ers about their use of test data in instruction. 
Many of these teachers reported that interim 
test results helped them monitor student 
progress and identify skill gaps for their stu- 
dents, and led them to modify curriculum and 
instruction (c.f., Christman, et al., 2009; 
Clune & White, 2008; Stecher, et al., 2008) 
These studies, however, did not examine how 
individual teachers actually analyzed and 
used these data to inform their classroom 
practice, the policy conditions that supported 
teachers’ ability to use interim assessment 
data to improve instruction, or the interaction 
of interim assessments with other classroom 
assessment practices. Our study begins to fill 
that vacuum. 

The purpose of this exploratory study was 
to examine the use of interim assessments and 
the policy supports that promote their use to 
change instruction, focusing on elementary 
school mathematics. We use the term “interim 
assessments” to refer to assessments that: a) 
evaluate student knowledge and skills, typi- 
cally within a limited time frame; and b) the 
results of which can be easily aggregated and 
analyzed across classrooms, schools, or even 
districts (Perie, Marion, & Gong, 2009). Our 
study looked at how 45 elementary school 
teachers in a purposive sample of 9 schools in 
2 districts used interim assessments in mathe- 
matics in 2006-07. The study focused on 
teachers’ use of data in a cycle of instruction- 


December 2009 
RB-51 


Graduate School of 
Education 

University of 
Pennsylvania 


This Policy Brief was 
derived from an extensive 
research report titled, 

From Testing to Teaching: 
The Use of Interim 
Assessment in Classroom 
Instruction. 

Visit CPRE's website 
(www.cpre.org) to 
download a free copy. 


Consortium for Policy 
Research 
in Education 

University 
of Pennsylvania 

Teachers College 
Columbia University 

Harvard University 

Stanford University 

University of Michigan 

University of Wisconsin- 
Madison 

Northwestern University 




Policy Briefs 


al improvement; that is, how teachers gather 
or access evidence about student learning; 
analyze and interpret that evidence; use evi- 
dence to plan instruction; and carry out 
improved instruction. It also considered the 
many factors that influence how teachers 
access, manage, interpret, and act on data. 
These include district and school policies and 
practices, and organizational norms and rou- 
tines, as well as educator capacity. 

The two study districts — Philadelphia and 
Cumberland, Pennsylvania 1 — share a com- 
mon accountability context (i.e., the same 
state standards and state test), use the same 
elementary mathematics program, Everyday 
Mathematics (EM), and had adopted interim 
assessments in elementary mathematics. By 
selecting one urban and one suburban district, 
we sought to learn how policy supports for 
assessment and instructional improvement 
function in these different environments. 
Within each district, we chose schools that 
had made Adequate Yearly Progress but 
reflected a range of student performance as 
well as the ethnic and socio-economic diver- 
sity of the district. Seven of the nine study 
schools were Title I schools. In each site, we 
conducted interviews with district administra- 
tors, principals and instructional support staff. 
In each school, we observed and interviewed 
all 3 rd - and 5 th -gradc teachers, our focal 
grades. We collected information on how the 
teachers analyzed and acted on their interim 
assessment data and how they would respond 
to student errors on assessment items. We also 
conducted a short survey designed to measure 
participating teachers’ mathematical knowl- 
edge for teaching (Hill, Shilling, & Ball, 
2004). 

This policy brief reviews the study’s key 
findings regarding the policy supports that 
existed to support data use and teachers’ actu- 
al use of interim (and related formative) 
assessment data. It also presents implications 
for educators, policymakers, and researchers. 
Our findings highlight the potential and limi- 
tations of interim assessments for the four 
stages of the instructional improvement cycle. 


1 Cumberland is a pseudonym for the suburban 
district in the study. 


We conclude that interim assessments that are 
designed for instructional purposes are help- 
ful but not sufficient to inform instructional 
change. When well-supported by their dis- 
tricts and schools, teachers used interim 
assessment data to decide what to re-teach to 
and to whom, but not necessarily to change 
the ways in which they taught this content. 
Rather, teachers’ instructional and assessment 
practices appeared to be affected more by 
their capacity to understand their students’ 
mathematical learning than by the type of 
assessment (interim or formative) they used. 

District and School Supports 
for Assessment Use 

Philadelphia and Cumberland adminis- 
tered interim assessments roughly every six 
weeks in the elementary grades. These assess- 
ments were aligned to district curriculum and 
were designed to test only those concepts and 
objectives taught during that time period. As 
discussed below, teachers were given a period 
of time after receipt of assessment results to 
review and/or extend development of these 
topics. Both districts adopted policies and cre- 
ated conditions that were designed to support 
teacher use of the interim assessment data for 
instructional improvement. These included 
setting strong expectations for data use; gen- 
erating timely and accessible analyses of stu- 
dent performance data that could inform 
instruction; dedicating time to analyze data, 
plan instruction and re-teach students; and 
providing instructional support for teachers 
and students. 

Expectations for use. Perie, Marion, and 
Gong (2009) discuss three possible uses of 
interim assessment results: a) instructional — 
to help teachers adjust their instruction and 
curriculum to address student learning needs; 
b) evaluative — to help educators evaluate and 
improve instructional programs; and c) pre- 
dictive — to determine each student’s likeli- 
hood of achieving a performance level on an 
end-of-year test. The districts in our study 
established and communicated expectations 
for the instructional use of interim assessment 
data at all levels of the system. The districts 
viewed these assessments as “teaching tools” 
that would support and guide teachers’ 
instruction. District staff and school leaders 
(principals in Philadelphia and curriculum 
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specialists in Cumberland) expected teachers 
to use assessment results to reflect on their 
instruction, to discuss and share common 
problems and instructional solutions, and to 
provide remediation and enrichment during a 
dedicated period of time following the assess- 
ments. 

Both districts designed their interim 
assessments to be part of their overall instruc- 
tional guidance systems, not as “mini state 
tests’" that mirrored the items in the high- 
stakes state assessment. The districts adopted 
curriculum in mathematics aligned to state 
standards, adopted common mathematics pro- 
grams across schools, developed instructional 
timelines linked to units in these programs, 
and aligned interim assessment items with 
content of the district curriculum and materi- 
als for each instructional period. 

The districts communicated their expecta- 
tions to principals and teachers through sever- 
al mechanisms. First, they structured their 
information management systems in a way 
that focused the attention of teachers, instruc- 
tional support staff and, in Philadelphia, their 
principals on tested skills and learning stan- 
dards. Philadelphia shaped and reinforced 
their expectations for analysis of the interim 
assessment results by mandating the use of a 
data analysis protocol that asked teachers to 
identify the weakest skills and concepts for 
the instructional period and instructional 
strategies for re-teaching these skills and 
concepts. Cumberland developed interim 
assessments at the request of teachers, and 
teachers participated in their development, 
creating more of a “buy-in” for their use. 
Although test results were not made public in 
Cumberland, the district required all teachers 
to enter assessment results into an electronic 
spreadsheet. The format of the spreadsheet, 
which highlighted student performance by 
content sub-areas, generated expectations of 
when and where teachers should provide 
additional support to students. 

Second, Philadelphia held principals 
accountable for ensuring that teachers 
accessed, interpreted, and acted on the results 
of the interim assessments. Philadelphia prin- 
cipals were required to complete and review 
data analysis protocols for their schools with 
their regional superintendents and to analyze 
school-level results with other principals at 
monthly meetings. Although the intent of 


Our research was funded by a National Sci- 
ence Foundation grant (#REC -0529485) to the 
Consortium for Policy Research in Education 
(CPRE). Opinions expressed in this report are 
those of the authors and do not necessarily 
reflect the views of the National Science Foun- 
dation, the study districts, CPRE, or its institu- 
tional members. 

these meetings was to generate constructive 
dialogue about instructional improvement, 
some educators viewed the public sharing of 
data as undermining the low-stakes, instruc- 
tional focus of the interim assessments. In 
Cumberland, expectations for the use of the 
interim assessments were communicated 
through the district’s curriculum and instruc- 
tion staff, keeping the stakes of the interim 
tests low and maintaining the emphasis on 
instructional use. 

Finally, principals in our Philadelphia 
study schools reinforced the district’s expec- 
tations for data use by modeling and monitor- 
ing teachers’ analysis of the interim assess- 
ments. The principals conducted their own 
analysis of assessment results to identify 
struggling students; areas of weak skills with- 
in a grade level; teachers whose classes might 
be falling behind those of their grade-level 
colleagues; and/or subgroup performance. 
Principals reviewed teachers’ data analysis 
protocols and they discussed the results of the 
interim assessments with teachers in grade- 
group meetings. In some of our study schools, 
principals looked for evidence of the reported 
re-teaching strategies in teachers’ lesson 
plans. District- and school-based curriculum 
specialists played this role in Cumberland, 
identifying common problems within grades 
and across schools and discussing assessment 
results with teachers. 

Technology and data. The districts devel- 
oped “ user-friendly ” instructional manage- 
ment systems (IMS) that scored student 
responses and facilitated teachers’ analysis of 
interim assessment data. The IMS in Philadel- 
phia enabled a teacher to view the perfor- 
mance of each student and the entire class on 
individual test items and by content standard, 
all with a click of the mouse. The system also 
displayed each student’s incorrect answer 
choices. The Cumberland IMS was less 
sophisticated. After teachers entered test 
results, the system automatically highlighted 
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students who missed more than one test item 
for a learning standard. While teachers could 
not conduct item analyses using this IMS, 
they could easily see how many and which 
students were possibly weak in a particular 
learning area. The Philadelphia IMS also pro- 
vided links to information on how to re -teach 
a particular standard and practice worksheets 
for students. Philadelphia provided profes- 
sional development on its IMS, but the train- 
ing focused on how to access and use the 
components of the system (“point and click”), 
rather than on how to analyze the interim 
assessment data. 

Both districts assumed that their interim 
assessments, which were aligned with 
instructional units, would provide teachers 
with valid and actionable information; that is, 
teachers could diagnose student error and 
design appropriate re-teaching based on the 
results of the tests, including an analysis of 
incorrect answers. As will be discussed 
below, it is doubtful that these interim assess- 
ments fulfilled this goal. 

Time. Both districts created dedicated 
time for teachers to discuss assessment results 
and instructional techniques, to re -teach con- 
tent and skills to students, and to participate in 
professional development. Philadelphia cre- 
ated six-week cycles of instruction and 
assessment: five weeks of instruction (tied to 
the district’s pacing guide) culminating with 
the interim assessments, and a sixth week of 
review and/or extended development of top- 
ics. At the end of the sixth week, teachers 
moved on to the next instructional unit. The 
cycle was somewhat different in Cumberland, 
where teachers generally administered the 
interim assessments, or practice tests, 3 to 5 
days prior to giving a curriculum-embedded 
end-of-unit test that was part of a student’s 
grade. In both cases, the districts expected 
teachers (and where available, other support 
staff) to provide remediation for students in 
areas of weakness and enrichment in areas of 
strength during these re-teaching periods. 
While schools in both Philadelphia and Cum- 
berland created common planning time for 
teachers in the same grades, and set aside ded- 
icated time for professional development dur- 
ing the school day, student or school issues or 
district-directed professional development 
often limited time available for teachers to 
discuss interim assessment results and com- 
mon instructional challenges. 


Professional support. The districts iden- 
tified two targets for professional support: 
teachers and students. Each of our Philadel- 
phia schools had a school-based teacher 
leader (SBTL) whose job was to assist teach- 
ers with data analysis and instruction in math- 
ematics. In three schools this was a part-time 
position, while in two the SBTL was full- 
time. The SBTLs helped teachers analyze 
data and locate additional instructional mate- 
rials, but they had limited time to provide 
instructional support to teachers. In Philadel- 
phia, teachers occasionally had other adults in 
their classrooms (generally volunteers or stu- 
dent teachers) to instruct struggling students 
or work with groups of students, thereby 
enabling teachers to implement small-group 
instruction. Many of our Philadelphia teach- 
ers made time during lunch hours and before 
or after school to give students additional sup- 
port. In contrast, Cumberland had an exten- 
sive system of instructional support for teach- 
ers and students through a district mathemat- 
ics coach and full-time school-based elemen- 
tary curriculum specialists (ECS) who served 
students as well as faculty. Cumberland 
schools also had mathematics aides who 
worked directly with teachers. This staffing 
facilitated the use of group instruction in the 
classroom and provided more intensive reme- 
diation to students identified as needing addi- 
tional help. 

As we see below, the type and level of dis- 
trict and school supports were one, but not the 
only, factor affecting how teachers used inter- 
im assessments. Others included teacher 
analysis and interpretation of interim assess- 
ment data as well as teacher capacity to assess 
and learn from students’ mathematical under- 
standing. 

School-level Implementation 
of Interim Assessments 

When well supported by districts, interim 
assessments were implemented in schools in a 
manner consistent with district expectations; 
that is, we found that teachers were attempt- 
ing to use interim assessment results for 
instructional improvement. In Philadelphia, 
the majority of the teachers in our study had 
accessed their classroom-level interim assess- 
ment results using the IMS, and they had all 
visited the “item-level analysis” view that had 
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been featured in district and school profes- 
sional development sessions. Most teachers 
were comfortable accessing those results 
directly using the IMS, although a few teach- 
ers relied on others in their building to print 
data out for them. In Cumberland, all teachers 
were inputting their classroom interim assess- 
ment scores into that district’s IMS. Among 
our sample, interim assessments had become 
part of teachers’ regular work. 

We found, not surprisingly, that teachers 
used interim assessment results in the context 
of information from their own measures of 
student learning. Teachers rarely reported 
being surprised at the results of the interim 
assessments, and they mentioned that the 
interim assessments largely confirmed what 
they already knew about student learning in 
mathematics. These assessments did, howev- 
er, provide uniform data that could serve as 
the basis for cross-classroom conversation. In 
addition, receiving the interim assessment 
results led some teachers to follow up by col- 
lecting more assessment information directly 
from students. For example, several of our 
teachers questioned students on their respons- 
es to individual interim assessment items and 
then used this information to plan further 
instruction. 

Teacher Analysis and Interpretation 
of Interim Assessment Data 

In both Philadelphia and Cumberland, 
teachers in our sample used the data from 
interim assessments to identify weak content 
areas or struggling students within their class- 
es. We found that teachers in Philadelphia 
often set thresholds for student performance 
(a score below which instructional response 
would be warranted), and that these thresh- 
olds varied from school to school and often 
from teacher to teacher. Teachers in Cumber- 
land were much less likely to employ specific 
thresholds of performance in their review of 
interim assessment data. Rather, they were 
more likely to speak of these results in the 
context of other information from their own 
formative assessment practice. 

Teachers’ processes for interpreting inter- 
im assessment data with an eye toward 
instructional planning were influenced by a 
variety of factors, including their knowledge 
about specific students’ backgrounds or past 


performance, student performance in relation 
to their peers, district factors such as the 
scheduling of interim assessments relative to 
the pacing guide, or teacher perceptions about 
which mathematical content was especially 
challenging for students. Furthermore, the 
IMS in each district influenced the steps 
teachers took in analyzing interim assessment 
results. Although the design of the IMS high- 
lighted areas of weakness in student perfor- 
mance in both districts, the system in 
Philadelphia enabled teachers to easily link 
assessment items, curricular content, and 
standards. In contrast, results were presented 
in Cumberland’s IMS by content area, with 
references to specific item numbers. This 
required teachers to take an extra step in relat- 
ing results back to specific items, a step that 
fewer teachers took. Because much of the 
interest in interim assessments stems from the 
assumption that they can be used to improve 
teaching by providing data on students’ 
understanding, a central focus of the study 
was the degree to which such assessments 
provided teachers with information about stu- 
dents’ mathematical thinking. We explored 
this issue in three ways. First, we analyzed a 
sample of the assessments themselves, focus- 
ing on the extent to which the items provided 
teachers with actionable information about 
students ’mathematical understanding. Among 
the Philadelphia assessments, only 6 of the 20 
items in the 3 rd -grade assessment and 5 of the 
20 items in the 5 th -grade assessment con- 
tained a set of distractors reflecting multiple 
errors that typical students might have. Fur- 
ther, distractors on only 2 to 3 items contained 
information on mathematical misunderstand- 
ings, as opposed to other sources of error 
(e.g., reading the problem correctly). The 
majority of the 3 rd -grade Cumberland assess- 
ment items contained unclear expectations 
(e.g., directions requiring students to 
“explain,” without specifying what about the 
answer they are to explain), answer choices 
that failed to reveal typical student errors, or 
content that was simply “not very mathemati- 
cal”. The majority of the Cumberland 5 th - 
grade assessment items had either unclear 
instructions or a design that did not support 
diagnosis because the problem types were too 
mixed to allow for detection of misunder- 
standing across item types. 
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Second, we analyzed the ways in which 
teachers interpreted student error in a variety 
of contexts. We hypothesized that even if 
interim assessment items were of high quali- 
ty, use of these data for instructional change 
or improvement required teachers to have the 
capacity to infer students’ thinking or under- 
standing from their responses to particular 
items. To explore the extent to which this 
occurred, we identified specific items from 
the interim assessments where there were 
plausible mathematical explanations for most 
or all of the incorrect answers. We then pre- 
sented teachers with incorrect responses on 
these items, and prompted them to tell us 
what the student might have been thinking. In 
both districts, a wide range of responses were 
observed. In general, these responses fell 
along a continuum ranging from procedural 
explanations of student error to conceptual 2 
diagnoses. For example, presented with a 
problem in which a student incorrectly added 
fractions with different denominators, some 
teachers would describe the student’s failure 
to find a common denominator (procedural), 
while others were more likely to note that the 
student did not seem to understand that a 
denominator represents the number of parts 
into which the whole is divided (conceptual). 
Overall, Philadelphia teachers were more 
likely to engage in procedural diagnoses, 
while Cumberland teachers were more evenly 
distributed between procedural and conceptu- 
al explanations of student error. Therefore, 
student performance even on potentially high- 
quality assessment items led to a wide range 
of inferences made by teachers in our study. 

Third, we explored teachers’ use of inter- 
im assessments within the wider context of 
their formative assessment practice. 


2 In the mathematics education literature, pro- 
cedural knowledge has been largely defined as 
“how to” and conceptual knowledge has been 
defined as “why” (Hiebert & Lefevre, 1986). 
Without joining current debates on the relation- 
ship between the two knowledge types 
(Baroody, Fed, & lohnson, 2007; Rittle-John- 
son & Sigler, 1998), we adopt Hiebert and 
Lefevre's straightforward distinction as our def- 
inition for the purposes of this study. Of course, 
we acknowledge that both knowledge types are 
necessary for the development of mathematical 
competence. 


This included “short-cycle” practices such as 
question routines and use of individual white- 
boards or response cards, along with teacher- 
developed assessments such as homework 
tasks or selecting appropriate activities from 
the instructional materials. Using a combina- 
tion of interview and observation data, we 
constructed profiles of teachers focused on 
how they collected, interpreted, and acted on 
interim, short-cycle, and teacher-developed 
assessment information. Overall, we found 
that teachers who focused on students’ con- 
ceptual understanding using one type of 
assessment were more likely to do so for all 
types of assessment, including interim assess- 
ments. This suggests that analytic or diagnos- 
tic capacity underlies effective formative 
assessment, regardless of whether those 
assessments are embedded within instruction, 
developed by teachers, or externally 
designed. 

In addition to diagnosing student error, 
many teachers in both districts attributed low 
performance on interim assessments to other 
learning challenges. These included a list of 
possible causes, including, but not limited to, 
weak reading ability, difficulty maintaining 
attention, and low levels of English language 
proficiency. Teachers in both districts also 
offered contextual or external explanations 
for students’ struggles, such as lack of back- 
ground knowledge, as contributing to difficul- 
ties in comprehending word problems. These 
factors were generally perceived to be outside 
of the teacher’s or school’s realm of influ- 
ence. 

Instructional Response to Interim 
Assessment Data 

While teachers accessed and analyzed 
interim assessment data, we found that this 
information did not substantially change their 
instructional and assessment practice. Teach- 
ers used interim assessment results largely to 
decide what content to re-teach and to whom, 
but not to make fundamental changes in the 
way that this content or these students were 
taught. Teachers’ use of classroom-based for- 
mative assessment did not necessarily lead to 
changes in instructional strategies either. 

The interim assessment results informed 
decisions about whom to teach, and these 
decisions differed by district. In planning for 


Can Interim Assessments be used for Instructional Change? 


re-teaching time, the most common response 
among Philadelphia teachers was to revisit 
content using a combination of whole-group 
and small-group instruction with a smaller 
number of teachers adding peer-tutoring to 
this mix. In Cumberland, the results from the 
interim assessments were primarily used to 
inform ongoing flexible grouping based on 
students’ various (mis-) understandings. 
These different ways of organizing re -teach- 
ing may have reflected, to some extent, the 
types and levels of support available to teach- 
ers. Individual remediation during class time 
was rare among the Philadelphia teachers in 
our study, due in part to lack of classroom 
support for practices like conferencing. In 
Cumberland, the lowest performing students 
could be referred to the curriculum specialist 
for learning support. 

The interim assessment data influenced 
what teachers chose to re-teach, but not nec- 
essarily how to teach it. While we saw varia- 
tion among teachers’ responses within each 
district, teachers in Philadelphia largely 
emphasized re-teaching procedural steps in 
their instructional planning, perhaps reflect- 
ing their focus on students’ procedural errors. 
In classrooms, we observed that this approach 
often took the form of teachers reworking 
examples (often problems from the interim 
assessments) with either the whole class or 
with a small group. While teachers in 
Philadelphia used the IMS to help with data 
interpretation, use of the IMS for more com- 
plex tasks, such as generating supplemental 
assessments or identifying curriculum, was 
far less common. In Cumberland, we also 
observed and heard about procedurally orient- 
ed approaches, but many teachers mentioned 
that part of re-teaching involves re-teaching a 
mathematical concept as well. Some Cumber- 
land teachers made a distinction between re- 
teaching and “completely re-teaching,” where 
the former might include follow-up lessons or 
student-worked examples on the board and 
the latter referred to direct instruction on a 
concept or skill that was already taught. This 
distinction is an important one, as re-teaching 
that emphasizes “ritualized skills and applica- 
tions” is unlikely to lead to increased student 
learning (McMillan, 2010, p. 45). 

We also examined the degree to which 
instructional strategies adopted by teachers 
during re-teaching differed from those used to 
teach the same content originally. To analyze 


change in instructional strategy, we explored 
the different ways in which teachers respond- 
ed instructionally not only to interim assess- 
ment data, but to all classroom-based forma- 
tive assessment information (e.g., teacher- 
developed quizzes, questioning routines, 
etc.). Nearly all of the teachers acted on for- 
mative assessment information with organi- 
zational strategies, with about half using it 
primarily or only in this way. For these teach- 
ers, formative assessment information was 
used to determine: what content to re-teach, 
which students need additional support, 
whether and how students should be grouped 
during re-teaching, and when to move on to 
the next concept or topic. Fewer than half of 
the teachers in our sample employed instruc- 
tional change strategies — modifications in 
how they intended to re-teach specific content 
or students — in response to formative assess- 
ment information. Many of these teachers 
simply opted for teaching content “a different 
way,” or made greater use of manipulatives in 
the hope that an alternate presentation might 
help students to grasp material with which 
they had struggled. 

Teachers who assessed for conceptual 
understanding were far more likely to employ 
instructional change strategies than those who 
did not. Examples of these strategies included 
use of additional representations or models of 
mathematical concepts (e.g., the introduction 
of arrays for multiplication or set models for 
fractions) and connecting students’ prior 
knowledge to current learning goals (e.g., 
relating algorithms for double-digit subtrac- 
tion to triple-digit subtraction). We can 
hypothesize that assessing students’ mathe- 
matical understanding (and not simply suc- 
cess or failure with procedures) affords teach- 
ers better opportunities to assess students’ 
learning needs. Many times, these needs are 
inconsistent with past instructional approach- 
es. 

It is also possible that teachers who are 
able to assess for conceptual understanding 
are also more likely to have the capacity to 
respond with a varied instructional repertoire. 
When we conducted an in-depth examination 
of the instructional practices of a small subset 
of our teachers, we found that the instruction 
and formative assessment practice of teachers 
with high levels of mathematical knowledge 
for teaching (MKT) was generally centered 
around student understanding: the teachers 
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actively sought to learn how students think 
and they responded to student understand- 
ings. The level of mathematics in these rooms 
was relatively high, and student engagement 
was maintained. In contrast, the lack of small- 
group and individual student work in the 
rooms of teachers with lower levels of MKT, 
coupled with mathematically superficial 
questioning routines, was an obstacle to learn- 
ing about student understanding. While these 
findings are based on data from only a small 
number of teachers, they raise questions about 
the possible relationships between teachers’ 
knowledge of how to teach mathematics and 
their instructional practices. 

Implications for Policy 
and Research 

Research literature about the impact of 
interim assessments on student learning is at 
best inconclusive. Optimism about its poten- 
tial largely derives from research on short- 
cycle formative assessment, which has been 
shown to improve both instruction and stu- 
dent learning. The critical question for policy- 
makers, then, is whether interim assessments 
can be used formatively. Put another way, can 
teachers use interim assessment data to make 
instructional changes that are likely to 
improve student achievement? 

Our study showed that interim assess- 
ments are useful but not sufficient to inform 
instructional improvement. When linked 
directly to a district’s curriculum, interim 
assessments helped teachers make decisions 
about what content to re-teach and to whom 
by identifying areas in which specific stu- 
dents or the class as a whole were performing 
poorly. Where resources were available, inter- 
im assessments also allowed teachers to help 
students in need of additional, individualized 
supports. 

Use of interim assessments for these pur- 
poses was facilitated by several district and 
school factors, including alignment of interim 
assessment content with standards and cur- 
riculum; expectations that interim assessment 
results would be used to inform instruction; a 
quality and accessible IMS that focused 
teachers’ attention on content as well as on 
items; time to re-teach content and skills to 
students; and instructional supports for strug- 
gling students and professional supports for 


teachers in data analysis and instruction. 
School leadership and a culture of data use 
were also critical factors in supporting teach- 
ers’ use of data. 

We found little evidence, however, that the 
interim assessments we studied helped teach- 
ers develop a deeper understanding of stu- 
dents’ mathematical learning — a precursor to 
instructional improvement. Most items in the 
assessments did not provide actionable infor- 
mation on students’ misunderstandings. In 
addition, teachers’ capacity to interpret 
assessment data played a major role in how 
they used the results of interim, and even for- 
mative, assessment. Many teachers focused 
on procedural rather than conceptual sources 
of student errors on test items, diagnoses that 
appeared to inform their instructional plan- 
ning during re-teaching. Teachers who 
assessed for conceptual understanding were 
more likely to use instructional change strate- 
gies that those who did not. Teachers’ mathe- 
matical knowledge for teaching also appeared 
to contribute to teachers’ instructional and 
assessment practices. 

The findings from our study, along with 
those from related research on formative 
assessment and data-driven decision making, 
lead us to make the following recommenda- 
tions about the design of assessment systems, 
supporting the use of interim assessments, 
and future research. 

Focus, align, and inform. The design of 
interim assessments must reflect their intend- 
ed use. While this study focused on the ways 
in which teachers used interim assessments 
formatively (i.e., to change instruction), inter- 
im assessments can also have predictive 
and/or evaluative purposes. Assessments 
should be chosen to serve a single purpose. If 
interim assessments are to be used formative- 
ly, they must be designed for instructional 
purposes. This may mean using other tests to 
meet predictive or evaluative goals. Assess- 
ments designed for instructional puiposes 
must be closely aligned with district curricu- 
lum as well as district and state standards. 
This principle applies not only to the con- 
structs that are assessed and the formats of the 
test, but to any supplemental components of 
the assessment. For example, recommended 
instructional strategies need to align to the 
instructional approach of the curriculum. 
Similarly, districts need to verify claims that 
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multiple-choice item distractors carry instruc- 
tionally useful information. Mathematics 
items should be written so that distractors rep- 
resent common errors in both procedure and 
conceptual understanding. 

Support teachers and students. Even if 
interim assessments are focused, aligned with 
curriculum, and of high quality, their impact 
on teaching and learning depends on how 
their adoption and use is supported at the dis- 
trict and school level. District and school 
leaders need to communicate consistent and 
clear messages about the purpose and use of 
interim assessment. School leaders should 
model effective data use for teachers and 
other support staff and should allocate school- 
level resources to support interim assessment 
use for instructional purposes. 

District IMS must return interim assess- 
ment data to teachers in a manner that is both 
timely and accessible; teachers must in turn 
be trained to use the IMS to its full capabili- 
ties. The goal should be to have teachers 
invest their time in interpreting results and 
planning instruction rather than navigating 
the IMS or entering data. Another critical fac- 
tor is time. Whether highly structured or flex- 
ible, pacing schedules must allow time for re- 
teaching to occur. Additionally, teachers 
should have regular time in their schedules to 
analyze interim assessment results and dis- 
cuss potential instructional responses. 

While a major goal of interim assessment 
is to improve classroom instruction, our find- 
ings also suggest that a secondary use of such 
assessments may be to identify students in 
need of additional support, such as added 
instructional time or tutoring. Schools that 
already have these resources in place should 
consider using interim assessments (together 
with teacher input) to identify students in 
need of support. Where such supports are lim- 
ited, schools should consider how to best 
respond to individual students who continue 
to struggle. This is an urgent issue given the 
multiple demands placed on teachers during 
regularly scheduled instructional time. 

Build instructional capacity. Building a 
high-quality assessment system that is sup- 
ported at the district and school levels is nec- 
essary for teachers to access, analyze, and dis- 
cuss data. How well teachers use such data in 
the classroom, however, reflects their capaci- 


ty to assess and teach for mathematical under- 
standing. Teachers who assess for conceptual 
understanding do so across multiple test for- 
mats, and appear to be more apt to enact 
instructional change strategies than those who 
pay attention to students’ procedural skills 
alone. Likewise, formative assessment, as a 
process, is heavily dependent on teacher 
capacity. 

When looking to increase teacher capaci- 
ty to use data for instructional improvement, 
districts and schools should consider that 
teachers need more professional development 
and support on interpreting data (e.g., diag- 
nosing student error) and on connecting this 
evidence to specific instructional approaches 
and strategies. Therefore, professional devel- 
opment for interim assessment use should go 
beyond using “point and click” to locate and 
organize data and should emphasize analysis 
of student results in the context of standards 
and curriculum. Analysis should incorporate 
information from other types of assessment 
(e.g., in-class student work, teacher observa- 
tion, etc.). In addition, part of using evidence 
of student learning to improve instruction is 
knowing how mathematical understanding 
develops and how to support students’ 
progress toward a learning goal. Thus, in 
mathematics, professional development for 
teachers should focus as well on teacher con- 
tent knowledge, developing teachers’ instruc- 
tional repertoires, and capacity to assess for 
students’ mathematical learning. 

The curriculum must be designed to allow 
for integration of assessment information 
from multiple sources and provide guidance 
for instructional response. In some cases, the 
potential for this opportunity lies within the 
current curriculum; for example, the program 
used by our study districts offers multiple 
types of assessment embedded within the cur- 
riculum as well as instructional suggestions 
for remediation and enrichment built into 
every lesson. In other cases, more appropriate 
programs or supplemental materials may need 
to be adopted. In addition, tools are being 
developed to enable the connection between 
interpretation and action; for example, newer 
technology platforms aim to link information 
gleaned from assessments with potential 
instructional responses. 
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However, adopting the right curriculum 
and tools are not, in themselves, sufficient to 
enable teachers to adjust instruction in 
response to assessment results. An extended 
research base supports the value of regular, 
facilitated teachers’ analysis of student work 
to inform instructional decision-making. One 
such model features groups of teachers exam- 
ining student work in collaboration with a 
content area expert (e.g., mathematics coach 
or curriculum specialist) on a regular basis 
throughout the school year. Teachers return to 
their classrooms with a list of possible 
instructional strategies developed by the 
group. The next meeting begins with teach- 
ers’ reporting on the success and challenges of 
implementing instructional change. This 
information, along with new student work, 
forms the basis for the next discussion. It is 
this kind of ongoing, supported capacity- 
building that gives teachers the best chance at 
turning assessment results into increased stu- 
dent learning. 

Research implications. This was an 
exploratory study focused on how teachers 
actually interpret and act on data from interim 
assessments. Below we make suggestions for 
further developing the field of research on 
interim assessments. 

First, we see a need to develop a more 
comprehensive body of research that focuses 
on actual assessment use. We believe that the 
most potential lies in examining assessment 
use within particular content areas (e.g., read- 
ing, writing, mathematics, science, etc.). In 
this way, we can identify trends and relation- 
ships that exist within content areas as well as 
others that may apply more generally (e.g., 
the importance of timeliness of assessment 
results). Likewise, the role of teacher capaci- 
ty for teaching and assessing within particular 
content areas is an important variable to con- 
sider when researching teacher assessment 
use. 

Second, there needs to be research on the 
quality of data generated by interim assess- 
ments. This is a severely neglected area of 
research, yet poor data precludes effective 
data use. Claims about the validity of interim 
assessment results for instructional use need 
to be investigated as a matter of course. 

Finally, research on assessment should 
examine interim assessment use in the context 
of the broader system of assessment. Current 


research tends to focus on individual assess- 
ments and not on the relationship among 
assessments. There is a need to examine the 
degree to which assessments of different 
types inform each other. For example, do 
teachers scaffold the information received 
from different assessments? To what degree 
do the characteristics of these assessments 
influence teacher use? Answering these ques- 
tions necessitates observing the instruction 
that is part of the assessment cycle. 
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